Data Mining of Gene Expression Microarray via Weighted Prefix Trees
نویسندگان
چکیده
We used discrete combinatoric methods and non numerical algorithms [9], based on weighted prefix trees, to examine the data mining of DNA microarray data, in order to capture biological or medical informations and extract new knowledge from these data. We describe hierarchical cluster analysis of DNA microarray data using structure of weighted trees in two manners : classifying the degree of overlap between different microarrays and classifying the degree of expression levels between different genes. These are most efficiently done by finding the characteristic genes and microarrays with the maximum degree of overlap and determining the group of candidate genes suggestive of a pathology.
منابع مشابه
Feature Selection and Classification of Microarray Gene Expression Data of Ovarian Carcinoma Patients using Weighted Voting Support Vector Machine
We can reach by DNA microarray gene expression to such wealth of information with thousands of variables (genes). Analysis of this information can show genetic reasons of disease and tumor differences. In this study we try to reduce high-dimensional data by statistical method to select valuable genes with high impact as biomarkers and then classify ovarian tumor based on gene expression data of...
متن کاملMining Accurate Shared Decision Trees from Microarray Gene Expression Data for Different Cancers
This paper studies the problem of mining shared decision trees across multiple application domains, including multiple microarray gene expression datasets for different cancers. Shared knowledge structures capture similarity between application domains and have many useful applications. Given two datasets with classes, we focus on shared decision trees that are highly accurate in both datasets ...
متن کاملGene Expression Profiling of DNA Microarray Data using Peano Count Trees (P-Trees)
The explosion of genomic data made possible by advances in parallel, high-throughput technologies in the area of molecular biology, has ushered in a new era in the area of Bioinformatics. During the last many years, efforts concentrated on sequencing the genome of organisms. Current emphasis lies in extracting meaningful information from this huge DNA sequence and expression data. The technique...
متن کاملModification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis
Recognizing genes with distinctive expression levels can help in prevention, diagnosis and treatment of the diseases at the genomic level. In this paper, fast Global k-means (fast GKM) is developed for clustering the gene expression datasets. Fast GKM is a significant improvement of the k-means clustering method. It is an incremental clustering method which starts with one cluster. Iteratively ...
متن کاملGene Identification from Microarray Data for Diagnosis of Acute Myeloid and Lymphoblastic Leukemia Using a Sparse Gene Selection Method
Background: Microarray experiments can simultaneously determine the expression of thousands of genes. Identification of potential genes from microarray data for diagnosis of cancer is important. This study aimed to identify genes for the diagnosis of acute myeloid and lymphoblastic leukemia using a sparse feature selection method. Materials and Methods: In this descriptive study, the expressio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005